RAPID: fast and accurate sequence-based prediction of intrinsic disorder content on proteomic scale.

نویسندگان

  • Jing Yan
  • Marcin J Mizianty
  • Paul L Filipow
  • Vladimir N Uversky
  • Lukasz Kurgan
چکیده

Recent research in the protein intrinsic disorder was stimulated by the availability of accurate computational predictors. However, most of these methods are relatively slow, especially considering proteome-scale applications, and were shown to produce relatively large errors when estimating disorder at the protein- (in contrast to residue-) level, which is defined by the fraction/content of disordered residues. To this end, we propose a novel support vector Regression-based Accurate Predictor of Intrinsic Disorder (RAPID). Key advantages of RAPID are speed (prediction of an average-size eukaryotic proteome takes <1h on a modern desktop computer); sophisticated design (multiple, complementary information sources that are aggregated over an input chain are combined using feature selection); and high-quality and robust predictive performance. Empirical tests on two diverse benchmark datasets reveal that RAPID's predictive performance compares favorably to a comprehensive set of state-of-the-art disorder and disorder content predictors. Drawing on high speed and good predictive quality, RAPID was used to perform large-scale characterization of disorder in 200+ fully sequenced eukaryotic proteomes. Our analysis reveals interesting relations of disorder with structural coverage and chain length, and unusual distribution of fully disordered chains. We also performed a comprehensive (using 56000+ annotated chains, which doubles the scope of previous studies) investigation of cellular functions and localizations that are enriched in the disorder in the human proteome. RAPID, which allows for batch (proteome-wide) predictions, is available as a web server at http://biomine.ece.ualberta.ca/RAPID/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantitative proteome-based guidelines for intrinsic disorder characterization.

Intrinsically disordered proteins fail to adopt a stable three-dimensional structure under physiological conditions. It is now understood that many disordered proteins are not dysfunctional, but instead engage in numerous cellular processes, including signaling and regulation. Disorder characterization from amino acid sequence relies on computational disorder prediction algorithms. While numero...

متن کامل

Markovian Delay Prediction-Based Control of Networked Systems

A new Markov-based method for real time prediction of network transmission time delays is introduced. The method considers a Multi-Layer Perceptron (MLP) neural model for the transmission network, where the number of neurons in the input layer is minimized so that the required calculations are reduced and the method can be implemented in the real-time. For this purpose, the Markov process order...

متن کامل

Prediction of Students' Tendency to Addiction based on Religious Orientation and Self-Differentiation

Addiction for narcotics is a dangerous reality and is one of the most important socioeconomic and health problems, threatens the human society and leads to social stagnancy in various aspects. The purpose of this study was to investigate the role of religious orientation and self-differentiation in Students' tendency to addiction. The research method was descriptive correlational. The “tendency...

متن کامل

Link Prediction using Network Embedding based on Global Similarity

Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...

متن کامل

Designing a Computerized Neuro-Cognitive Program for Early Diagnosing Children at Risk for Dyslexia

Objectives: The aim of this research is to design a neuro-cognitive program, based on dysfunctions and alterations of some neural circuits in dyslexics. The visual and auditory working memories in pre-schoolers were evaluated with this program in order to early screening for dyslexia. Methods: This study is a longitudinal descriptive research. A total of 259 randomly selected pre-schoolers, wi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Biochimica et biophysica acta

دوره 1834 8  شماره 

صفحات  -

تاریخ انتشار 2013